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ABSTRACT 

Examining a sample of massive galaxies at 1 .4 < z < 2.5 with Kvega < 22 from the Great Observatories Origins 
Deep Survey, we compare photometry from Spitzer at mid- and far-IR, to submillimeter, radio and rest-frame 
ultraviolet wavelengths, to test the agreement between different tracers of star formation rates (SFRs) and to 
explore the implications for galaxy assembly. For z^2 galaxies with moderate luminosities (Lg^,,, < lO'^L;^,), 
we find that the SFR can be estimated consistently from the multiwavelength data based on local luminosity 
correlations. However, 20-30% of massive galaxies, and nearly all those with Lg^^,,, > lO'^L©; show a mid-IR 
excess which is likely due to the presence of obscured active nuclei, as shown in a companion paper There is 
a tight and roughly linear correlation between stellar mass and SFR for 24 /xm-detected galaxies. For a given 
mass, the SFR at z = 2 was larger by a factor of ~ 4 and ^ 30 relative to that in star forming galaxies at z = 1 
and z = 0, respectively. Typical ultraluminous infrared galaxies (ULIRGs) at z = 2 are relatively 'transparent' 
to ultraviolet light, and their activity is long lived ( 400 Myr), unlike that in local ULIRGs and high redshift 
submillimeter-selected galaxies. ULIRGs are the common mode of star formation in massive galaxies at z = 2, 
and the high duty cycle suggests that major mergers are not the dominant trigger for this activity. Current 
galaxy formation models underpredict the normalization of the mass-SFR correlation by about a factor of 4, 
and the space density of ULIRGs by an order of magnitude, but give better agreement for z > 1 .4 quiescent 
galaxies. 
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L INTRODUCTION 

Tracing the buildup of galaxies at high redshifts requires 
reliable measurements of two fundamental quantities: stellar 
masses and star-formation rates (SFR). The integrated history 
of star-formation rates within galaxies provides a lower limit 
to their stellar mass growth. Stellar mass growth in excess of 
this contribution is to be ascribed to galaxy mergers. Most of 
the rise in the space density of massive galaxies takes place in 
the cosmic time interval between z 3 and z 1 (e.g., Dick- 
inson et al. 2003; Rudnick et al. 2003; 2006; Fontana et al. 
2004; 2006; Daddi et al. 2004a, D04a hereafter; van Dokkum 
2006; Franceschini et al. 2006; Pozzetti et al. 2007; Arnouts 
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et al. 2007). It is still unclear, however, what is the major 
channel for galaxy growth during this period, and the quan- 
titative role of mergers in building up the mass of galaxies. 
Violent starbursts have been found to be common in the z ~ 2 
universe (D04a, Daddi et al. 2005b, D05b hereafter; Shapley 
et al. 2005; Forster-Schreiber et al. 2004a), but their relevance 
on the mass growth of galaxies critically depends on the ef- 
fective duration of these vigorous star-formation events (see 
D05b). To make substantial progress one needs to look at the 
distribution of star-formation rates in distant galaxies, in or- 
der to ascertain the relative frequency of these events among 
massive galaxies. This requires accurate measurements of in- 
dividual star-formation rates in large samples of distant galax- 
ies. 

The advent of the Spitzer Space Telescope holds substantial 
promise to achieve these goals, as it now routinely allows one 
to detect distant galaxies with the Multiband Imaging Pho- 
tometer for Spitzer (MIPS) at 24 /im all the way to redshift of 
~ 3 (and even beyond). This corresponds to the rest-frame 
spectral mid-IR region (5-I2/im), which tends to be dom- 
inated by polycyclic aromatic hydrocarbon features (PAH). 
In the local universe, the mid-IR luminosities of galaxies are 
known to correlate with the total infrared luminosities (Lir; 
see Chary & Elbaz 2001; Elbaz et al. 2002; Dale & Helou 
2002; Forster-Schreiber et al. 2004b), a well established probe 
of the star-formation rate of galaxies. Problems however ex- 
ist because the PAH emission is known to be reduced at low 
metallicities (Engelbracht et al. 2005; Madden et al. 2006). 
Spitzer observations show a substantial scatter in the local 
correlation of mid-IR luminosity vs. the total infrared lumi- 
nosity LiR (Dale et al. 2005; Smith et al. 2006; Armus et al. 
2006). In addition, a substantial contribution to the mid-IR 
light can come from the accretion disks of Active Galactic 
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Nuclei (AGNs). AGNs in distant galaxies can be identified by 
their X-ray emission, but this would not be detectable at z = 2 
in the case of large column densities of obscuring material 
even for luminous AGN (e.g., as in the case of NGC 1068). 

Several recent surveys have used the 24 /im emission in 
1 < z < 3 galaxies in order to estimate their ongoing star- 
formation rates and related quantities (e.g., Perez-Gonzalez 
et al. 2005; Reddy et al. 2005; D05b, Papovich et al. 2006; 
Caputi et al. 2006ab; etc.). This is generally accomplished by 
using prescriptions relating the mid-IR luminosity to Lir, as 
calibrated in the spectral libraries from Chary & Elbaz (2001; 
CEOl hereafter) and Dale & Helou (2002; DH02 hereafter), 
although sometimes other empirical conversions have been 
used (see, e.g., Caputi et al. 2007). However, it remains to 
be established to which extent these estimates are still accu- 
rate for z ^ 2 galaxies. For example, the impact of the sili- 
cate 9.7 /im absorption feature at high redshift has still to be 
quantified, and the relative strength of the PAH emission as a 
function of star formation activity might be different at high 
redshift with respect to the local Universe. 

Using multi-wavelength data from the Northern field of the 
Great Observatories Origins Deep Survey (GOODS), D05b 
showed that the average SED of massive z — 2 galaxies is 
consistent with that of present-day ULIRGs. This implies, on 
average, a good agreement between star-formation tracers as 
calibrated locally and based on the radio, far-lR, mid-lR and 
even UV (corrected for the extinction by dust) and the X-ray. 
This agreement is encouraging in view of our need to estimate 
the star-formation rates in distant galaxies, especially given 
the currently limitations in probing the luminosities of galax- 
ies at z ^ 2 at the peak of the far-lR emission at ^ 60 — 100 /im 
in the rest-frame. In this regard, the situation will eventually 
improve with Herschel, and later on with ALMA. In D05b a 
number of interesting features of high redshifts ULIRGs were 
discussed, e.g., that they may have fairly long duty cycles and 
that they appear to be relatively 'transparent' to the UV pho- 
tons. These and other aspects are visited again here in more 
detail. 

The main aim of this paper is to verify to which extent this 
agreement among the various star formation tracers persists 
individually for distant massive galaxies, hence to which ex- 
tent SFRs can be unambiguously measured at high redshift. In 
particular, reliable SFRs are instrumental for relating to them 
other galaxy properties in a meaningful way. For example, re- 
cent surveys have shown that a tight correlation exists at z = 
and at z = 1 between the stellar mass and the SFRs of blue, 
actively star forming galaxies (Noeske et al. 2007; Elbaz et al. 
2007). It is not yet clear if such a fundamental correlation ex- 
ists also at higher redshifts, when most of the stars in massive 
galaxies were formed. 

For this work we make use of some of the deepest multi- 
wavelength datasets, available as part of GOODS. In particu- 
lar, compared to D05b, we use here a much deeper 1 .4 GHz 
dataset over GOODS-North (GOODS-N hereafter) obtained 
with the Very Large Array (VLA) (Morrison et al. in prepa- 
ration), and 70 /im photometry from both GOODS fields 
from Frayer et al. (2006). Also crucial to this work is the 
large dataset of spectroscopic redshifts that is available to 
us, including the VLT/F0RS2 spectroscopic redshifts for the 
GOODS-South field (GOODS-S hereafter; Vanzella et al. 
2005, 2006) and those from the ultradeep spectroscopy of the 
GMASS survey (Kurk et al., in preparation). Finally, we have 
used in this work the latest realization of galaxy formation 
models based on the Millennium simulations (Springel et al. 



2005), in the form of mock lightcones presented by Kitzbich- 
ler & White (2007). This allows us to constrain the models 
and gain insights into the nature of star forming galaxies at 
high redshifts. 

The paper is organized as follows. The sample selection of 
z 2 galaxies, with spectroscopic and photometric redshifts 
are presented in Section|2] The multiwavelength datasets used 
are presented in Section|3] with a discussion of known limita- 
tions of each dataset as a star formation indicator, especially 
for high redshift galaxies. Section |4]compares the 8 /im rest- 
frame luminosities of galaxies (estimated from their 24 /im 
flux densities) to those at other wavelengths, while Section |5] 
focuses on the use of ultraviolet (UV) luminosities and on 
the reliability of extinction corrections. A suggested recipe 
for estimating SFRs in z = 2 galaxy samples is given in Sec- 
tion |6] We present the implications of our findings for the 
characterization of star formation in distant galaxies, galaxy 
growth and the comparison with theoretical models in Sec- 
tion |7] Our conclusions are summarized in Section |8] We as- 
sume a Salpeter initial mass function (IMF) from 0. 1 and 100 
Mp, and a WMAP3 cosmology with 17a, Om = 0.76,0.24, 
and h = //o[km s"' Mpc^ij/lOO = 0.73 (Spergel et al. 2007). 
Throughout the paper we use magnitudes in the Vega scale, 
unless stated otherwise. 

2. SAMPLES OF z ~ 2 GALAXIES IN GOODS 

The sample of galaxies analyzed in this study has been se- 
lected in the /T-band, and applying the BzK selection tech- 
nique introduced by Daddi et al. (2004b; D04b hereafter), 
that allows us to define highly complete samples of galaxies 
in the redshift range 1.4 < z < 2.5. Unless explicitly stated 
otherwise, we limit the analysis to BzK star forming galaxies 
(or sBzKs, as opposed to the passive ones or pBzKs), as we 
aim to focus on galaxies with active star formation. In the 
GOODS-N field we have selected 273 galaxies to K < 20.5, 
corresponding to the 5-sigma detection limit of the data taken 
with the Flamingos camera on the Mayall 4m NOAO tele- 
scope (the dataset used here is slightly deeper than that used 
in D05b). In the GOODS-S field we selected 1018 sources 
detected with S/N > 5 down to K < 22. The GOODS-S field 
A'-band VLT ISAAC data is not homogeneously deep over the 
whole field, but reasonably complete over the whole used area 
at these K-hwA limits. We have excluded from the analysis 
all hard X-ray detected galaxies in both GOODS fields, us- 
ing the catalogs of Alexander et al. (2003), and galaxies with 
power law SEDs over the 3.6-24 /im wavelength range ob- 
served by the Spitzer Infrared Array Camera (IRAC) as likely 
AGN-dominated sources. Only a handful of power-law galax- 
ies remain undetected in our deep X-ray imaging data. Also, 
in both fields we only consider galaxies whose IRAC coun- 
terpart is closer than 0.5" from the K-band position. We have 
empirically verified that this criterion is efficient at excluding 
sources where the IRAC bands photometry is substantially 
contaminated by blending due to the relatively poor IRAC 
PSF of about 1.6". Statistically, this excludes some 10% of 
the galaxies, for the accurate IRAC detection procedures car- 
ried out in GOODS and based on mexhat kernel detections 
using SExtractor (Dickinson et al., in preparation). This crite- 
rion is required in order to have reliable photometry over the 
IRAC bands (hence solid photometric redshift estimates), and 
to avoid problems with blending in the MIPS 24 /im photome- 
try. The 24 /im flux densities (Chary et al., in preparation) are 
in fact estimated using prior source positions from the IRAC 
catalogs. 
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Spectroscopic Redshift Spectroscopic Redshift 

Fig. 1. — Comparison of spectroscopic and photometric redshift for galaxies in GOODS-N (left panel) and in GOODS-S (right panel). The GOODS-S 
photometric redshifts that we use are taken from Grazian et al. (2006), and we show in the right panel of this figure only those galaxies that did not already have 
a spectroscopic redshift in Grazian et al. (2006). 




Redshift Redshift 

Fig. 2. — The redshift distribution of the z ~ 2 sample of galaxies considered in this paper for GOODS-N (left panel) and GOODS-S (right panel). SoUd 
histogram refer to spectroscopic redshifts, while the empty histogram is for photometric redshifts for BzK selected star-forming galaxies. 



In both fields we used multiband photometry in the optical 
and near-IR (see Giavalisco et al. 2004 for datasets descrip- 
tions), updated to the latest releases and observations. SEDs 
were derived using the method described, e.g., in D04b and 
D05b. Photometric redshifts were determined in both fields 
for all BzK selected z ~ 2 galaxies by fitting to the empirical 
templates of Coleman et al. (1980), as described in D04ab. 
However, for the GOODS-S field we find that the photomet- 
ric redshifts measured by Grazian et al. (2006), which include 
deep U-band data similarly to what we have in GOODS-N, 



are in sUghtly better agreement with the spectroscopic red- 
shifts and we decided to adopt them for the present analysis. 

Spectroscopic redshifts are available in GOODS for a fairly 
representative fraction of the galaxy sample considered in this 
paper. When limiting to spectroscopic redshifts with flags 
warranting reUabiUty, redshifts are available for 175 galaxies 
in GOODS-S (17% of the total sample) and 57 in GOODS-N 
(21% of the sample). For GOODS-S, in addition to all sources 
of publicly available redshifts (VVDS: Le Fevre et al. 2004; 
GOODS/FORS2: Vanzella et al. 2005, 2006; K20 survey: 
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Mignoli et al. 2005; CDFS survey: Szokoly et al. 2004) we 
have most notably used results of redshift surveys developed 
as pait of GOODS or in collaboration with GOODS, that ex- 
plicitly targeted the z^2 galaxies. This includes the Galaxy 
Mass Assembly Spectroscopic Survey (Kurk et al. 2006, and 
in preparation'^) that targeted Zphot > 1-4 galaxies selected 
at 4.5/im (IRAC channel 2) to a limiting AB magnitude = 
23. For the GOODS-N field major sources of z > 1 .4 spectro- 
scopic redshifts are Keck LR-blue and DEIMOS observations 
carried out within GOODS (Stern et al., in preparation) and 
the BM/BX samples of Reddy et al. (2006b), in addition to 
the TKRS survey (Wirth et al. 2004). 

Fig.[T]panels show the comparison of photometric and spec- 
troscopic redshifts for our samples of BzK selected galaxies. 
The dispersion of the residual in the difference between spec- 
troscopic and photometric redshifts is a{zspec ~Zphot) ^ 0.25 
for GOODS-N and GOODS-S. The larger number of outliers 
in the full samples, with respect to the BzK samples, shows 
that the use of the BzK selection does help in defining a clean 
sample of z ^ 2 galaxies, which is substantially free from 
lower redshift contaminants. We have identified a small sam- 
ple of galaxies in both fields with strong discrepancies be- 
tween spectroscopic and photometric redshifts. From a care- 
ful analysis of the multiwavelength SED of these outliers, we 
found that in most cases the spectroscopic redshift appears to 
be the wrong one, and we have adopted the photometric red- 
shift for the analysis. This does not affect any of our major 
conclusions. 

Fig. |2] panels show the redshift distribution of the sources 
used for the analysis of z 2 SFR indicators in this paper We 
do not include in the analysis carried out in the next sections 
the objects in the tails of the distributions, with either z < 1 .2 
orz > 3. 
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Fig. 3. — The limiting Ljr that can be detected with the various tracers used 
in this work. We consider limits of 14. 1 /^Jy (3(t) for radio 1.4GHz, 12/jJy 
(3(t) for MIPS 24 fim, 1.8 mJy for MIPS 70 ^m (3(7) and 4mJy for 850 ^m 
(flux density of typical detections, Pope et al. 2006, some small regions in 
GOODS-N have actually somewhat fainter detections). We use the model 
of CEOl to convert flux densities into Ljr as a function of redshift, except 
for radio where we use the radio-IR correlation. For the UV. the effective 
SFR limit in our samples (formally converted to Ljr using Eq. 1) is given in 
principle by the depth of the B-band imaging (B = 27 AB, 3cr), from which 
we derive the UV luminosity. In case of non-zero dust reddening the reached 
SFR limit is less deep. MIPS 24 fj,m is by far the deepest long wavelength 
probe of SFR and IR luminosities at these high redshifts. 



3. MULTIWAVELENGTH DATASETS, THEIR INTERPRETATION AND 
KNOWN LIMITATIONS 

A major focus of this paper is the comparison between lu- 
minosities at various wavelengths, each of which is known to 
correlate with the SFR in galaxies. In order to put emission 
at different wavelengths onto a common footing for intercom- 
parison, we will adopt nominal conversions between these ob- 
served luminosities and fiducial quantities such as the total 
infrared luminosity (Lir, integrated from 8 to 1000/im) or an 
equivalent star formation rate (SFR). The use of Lir as a star 
formation indicator implies that all energy from star formation 
is absorbed by dust and re-radiated in the thermal infrared, 
and that there are no other significant sources for dust heating. 
Neither assumption is universally correct: on the one hand not 
all UV photons from young massive stars may be absorbed by 
dust, whereas emission from an AGN can boost emission at 
various wavelengths, including the infrared luminosity Ljr. 
Thus, SFR estimates based only on Lir would result in un- 
derestimates in the former case, and in overestimates of the 
true SFR in the latter. Starburst galaxies with the high SFRs 
(many tens to hundreds of Mr.-, yr^') and infrared luminosi- 
ties (Lir > 1O''L0 are called luminous infrared galaxies or 
LIRGs; then, for Lir > lO'^Lp one refers to ultraluminous in- 
frared galaxies, or ULIRGs; we will also refer to objects with 
LiR > lO'^Lf) as hyperluminous, or HyLIRGs) characteristic 
of some the z '--^ 2 galaxies studied here. Values for Lir and the 
SFR may differ substantially, depending on the specific tracer 
used to evaluate them. In this section, we present the different 

see |http://www.arcetri. as tro.it/cimatti/gmass/gmass .html I 



indicators, and give a description of the techniques used to an- 
alyze the data, and of the recipes used to convert the observed 
emission to SFRs. These recipes are generally based on em- 
pirical calibrations in the local universe. In the next section, 
we will compare the derived equivalent Lir or SFR values, 
looking for agreement, which would suggest coherent behav- 
ior from low to high redshift, or for discrepancies, which may 
suggest that other effects are coming into play. 

Following Kennicutt (1998), we may express the conver- 
sion between Lir and SFR as: 

SFRiR[M0yr-i] = 1.73 X 10-^%^[Lq]. (1) 

This conversion depends on the adopted IMF; here we have 
adopted the IMF of Salpeter (1955). Given this relation, 
SFRiR and Lir can be seen as equivalent quantities for star 
forming galaxies. In order to estimate the total SFR, we 
must also include the contribution of energetic emission from 
young, massive stars that is not absorbed and reprocessed by 
dust. We estimate it from the rest-frame UV luminosity as in 
Eq. 5 of D04b, with no reddening correction, so to have: 

SFRtot = SFRiR + SFRuv^uncorrected- (2) 

We shall return to the conversion of UV light to SFR in Sec- 
tion 3.6. The second term above is negligible for red galaxies, 
as for most of the Bz^T-selected z 2 sources with K ^ 20, 
but becomes more relevant at fainter magnitudes when bluer 
galaxies are more prevalent. 
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Fig. 4. — The detection rate at 24 [oa of the z ~ 2 galaxies in our sample as 
a function of the K-band limiting magnitude (cumulative fractions), for the 
GOODS-S (solid hne) and GOODS-N (dotted line) fields. Red lines are for 
a detection limit of 100 /^Jy, matched by most shallower surveys performed 
at 24 [im with Spitzer. The very large detection fraction in GOODS up to 
K = Tl 'vs, in line with the expectations based on Fig.|3] 



3.1. Spitzer MIPS 24 fim imaging 

Both GOODS-N and S have been observed with Spitzer 
MIPS at 24 fim for about 10 hours per sky pixels. See D05b 
for a slightly more detailed description of these datasets and 
for the associations of 24 /im flux densities to K-selected 
galaxies. Catalogs for the 24 /im sources will be presented by 
Chary et al. (in preparation). For a z = 2 galaxy the 24 /im 
emission corresponds to approximately an 8 rest-frame 
measurement. For each galaxy in the 1.4 ^ z 2.5 redshift 
range we have derived bolometric infrared luminosities from 
the observed flux density at 24 /im and using the luminosity 
dependent SED libraries of Chary & Elbaz (2001) and Dale 
& Helou (2002). In practice, this is done by interpolating Ljr 
over the template SEDs, sorting the value of Ljr that corre- 
sponds to the observed 24 fim flux density. This determination 
of the bolometric IR luminosity implies a large extrapolation, 
as the ^ 8 /im rest-frame luminosity is generally a small frac- 
tion of LiR, and therefore the result is template-dependent. For 
this reason we also derived more direct 8 /im rest-frame lumi- 
nosities from the 24 pim flux density, using the model SEDs 
(from CEOl and DH02) only to K-correct the measurement 
from 24 fim observed to 8 fim rest-frame luminosities (this K- 
correction is also template dependent but generally small in 
the probed redshift range, within a factor of 2). For the CEOl 
templates the relations between L(8 /im) and Lir is: 

Log{^)^ 1.50 xLog{^)-A3l (3) 

for Log {Lsf,m/LQ) > 9.75, and 

Logi^)^0.93xLogi^) + 1.23 (4) 



for Log {L$fj,,„/LQ,) < 9.75; while fortheDH02 templates it is: 
Log{^) = 1.21 xLogi^)- 1.25. (5) 

Lq Lq 

Here, and in the remainder of the paper, L{^p,m) is lyLi, at 
8/fm. The two relations give the same Ljr ^ 5 x 1O"L0 for 
Lf,fj,m ^ 5 X lO'^L©, For higher Lg^m the CEOl templates re- 
sult in increasingly larger values of Lir, up to a factor of 2.3 
for Ls^in = 10^2^0- At Lg^,„ < 5 x lO^L© the CEOl tem- 
plates results in lower Lir values with respect to DH02. For 
the range of interest in this work, the differences are always 
well within a factor of 2, also due to the two-slope dependence 
of the CEOl conversion. 

3.2. Spitzer MIPS 70 pm imaging 

The 70 pm data used here are a combination of all of the 
GOODS-N and GOODS-S available to us from the Spitzer 
programs lD-3325, -20147, -81, and -30948. The data were 
reduced following the techniques given in Frayer et al. (2006). 
These data represent 3 hours of integration over the full 
GOODS-N field and the central (10' x 10') region of GOODS- 
S. The outer regions of GOODS-S have an integration time of 
about 2 hours. The typical 3a limit of these data is 1.8mJy 
at 70 /im in the deepest regions. Only 7 BzK star forming 
z ~ 2 galaxies are individually detected at 70 /xm (6 of which 
are in GOODS-N, a likely effect of small number statistics). 
Therefore, we used stacking techniques to measure average 
properties of galaxies at 70 pm. Bright low-redshift sources 
were removed from the data to avoid biasing the stacking re- 
sults. We convert the 70 pm flux densities into total IR lumi- 
nosities again using the CEOl and DH02 template libraries. 
Being closer to the peak of IR SEDs, the conversion of ob- 
served 70 /im flux densities into Ljr is much less critical than 
for 24 /im, even at z ~ 2. 

3.3. VIA 1 .4 GHz radio imaging 

For this work, we use new ultradeep 20 cm ( 1 .4 GHz) imag- 
ing of the GOODS-N obtained at the VLA in various array 
configurations for maximizing the data quality and the dy- 
namic range (Morrison et al., in preparation). Coadded to the 
previous radio data (mostly from Richards et al. 2000), the 
new data currently reach an rms of 4.5 /iJy per beam over the 
field. We then assume a radio spectral index a — —0.8 for 
deriving 1 .4 GHz rest-frame radio luminosities from the ob- 
served flux density. The local radio-lR correlation (Condon et 
al. 1992; Yun, Reddy & Condon 2001) is used to estimate the 
bolometric IR luminosity from the radio luminosity: 

Lir/L0 = 3.5 X 10-^2^1.4 G//z) [W//z"'] (6) 

Using radio luminosities to estimate Ljr for our sources im- 
plies assuming that the radio-IR correlation still holds at high 
redshift. The dispersion of the local radio-lR correlation can 
reach a factor of 2 for the most luminous sources (Yun et al. 
2001), with luminosities comparable to those of the objects 
detectable at z ^ 2. The introduction of a K-correction term, 
to derive 1 .4 GHz rest-frame flux densities from the observed 
ones, can introduce extra noise (or a bias) if the radio slope is 
different from the assumed value. This effect is likely smaller 
than the intrinsic dispersion seen locally, as a 10% error in 
the slope translate in a similar error in the intrinsic 1 .4 GHz 
rest-frame flux density and therefore in the estimated Ljr. 

In order to reach deeper radio flux densities, we stacked in- 
dividuaUy undetected sources using the following technique. 
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Fig. 5.— Spitzer MIPS 24 fim flux density for galaxies with K < 20.5 in the GOODS-N field (left panel) and those with K < 22.0 in the GOODS-S field (right 
panel). Black filled symbols are for galaxies with known spectroscopic redshift, while red empty symbols are photometric redshifts. The tracks show expected 
24 fj,m flux densities for galaxies with total IR luminosity of lO" , 10'^ , lO'^'L,., (LIRGs, ULIRGs, HyLIRGs). based on the model SEDs of Chary and Elbaz 
(2001). 



We measured the average radio flux density level in a 1" ra- 
dius aperture and correlated this measure to the total radio flux 
density using simulated radio sources of various flux densi- 
ties (defining an aperture correction, about a factor of 1.4). 
By measuring a large number of empty regions in the radio 
image we estimated the rms fluctuations of this measure in 
order to associate an error to each measure. This allows us 
to recover radio flux densities of ultra-faint sources with an 
rms of 4.7 /iJy per source, very close to the noise per beam. 
We used a 1" radius aperture, very close to the radio beam 
size, after testing a variety of apertures from 0.5" to 2", as it 
provides the best compromise between final S/N and the ne- 
cessity to measure over a relatively large area in order to cope 
with possible coordinate misalignments. To statistically ac- 
count for the coordinate mismatches (of order of 0.1-0.2"). 
By applying this procedure to bright, individually detected 
galaxies based on the K-band prior position, and comparing 
the result to radio flux densities derived by direct fit of the 
profiles, we found that the radio flux densities required an av- 
erage additional correction of 10%. All these measurements 
and errors take into account the primary beam correction for 
off-axis measurements. For obtaining stacking measurements 
of undetected sources we obtained weighted averages of the 
flux densities measured in this way. 

3.4. SCUBA 850 mutn imaging 

The GOODS-N field has been observed with the Submil- 
limeter Common User Bolometer Array (SCUBA) at 850 fim 
in a variety of programs, whose data have been merged to- 
gether into a so-called super-map by Borys et al. (2003) and 
Pope et al. (2005). The depth of the submillimeter (submm) 
map is highly variable over the field. In D05b, we found that 
only one GOODS-N BzK galaxy with 7*: < 20 is individu- 
ally detected at 850/im in the catalog described in Pope et al. 
(2006). When considering BzK galaxies down to K < 20.5, as 



in the present work, we detect five objects at 850/im. We note 
again here that we are excluding all submm galaxies (SMGs) 
that are individually detected in hard X-rays. 

We obtained SCUBA 850 fim stacking of individually un- 
detected sources by estimating flux densities and errors from 
the super-map at the position of each program galaxy, and 
then proceeding as in D05b. 

Using sub-mm flux densities at 850 /im to infer bolometric 
luminosities (Ljr) relies on assumptions for the long wave- 
length shape of the SED, i.e., on dust temperatures. For this 
work, we continue to use the SED shape templates of CEOl 
and DH02, derived from local correlations. The /gso^^,,, to 
LiR conversion is to first approximation independent of red- 
shift for the 1.4 < z < 2.5 range explored here. However, 
the conversion is not perfectly linear According to the CEOl 
templates, a ULIRG (Lir ~ lO'^Lj ) would have a flux den- 
sity of about 1 mJy at these redshifts, raising to about 4.6 mJy 
for a HyLIRG (Ljr = lO'^L©)- DH02 templates give - 20% 
higher flux densities at a given Ljr, over the same regime 
of luminosities. The non linearity is due to the luminosity- 
temperature trend observed in the local Universe (less lumi- 
nous galaxies harbor colder dust). 

3.5. X-ray imaging 

We use the catalog of X-ray detected sources of Alexander 
et al. (2003). All sources detected in the hard X-ray band were 
excluded from the SFR analysis in this paper as likely contain- 
ing AGNs. SFRs can be derived from the soft band 0.5-2 ke V 
fluxes (mapping closely to the 2-8 keV rest-frame band) at 
z = 2 (see e.g., RanaUi et al. 2004, Persic et al. 2004, Horn- 
schemeier et al. 2005), provided that no substantial AGN con- 
tamination is present at those energies. We have performed 
X-ray stacking of BzK samples over the two fields. Results 
are presented in Paper II, as an assessment of the presence of 
AGNs is required for interpreting the X-ray stacking results. 



z — 2 star-formation in GOODS 



7 




Redshift Redshift 

Fig. 6. — The bolometric infrared luminosity of galaxies Lir estimated from the 24 ^lm flux densities using the templates of CEOl is plotted against redshift 
for K < 20.5 sources in GOODS-N (left panel) and GOODS-S (right panel). Symbols are as in Fig.|5] 



3.6. UV based SFR estimates 

Ongoing SFRs in z 2 galaxies are estimated using the UV 
luminosity, following the prescriptions in D04b. The 1500A 
rest-frame luminosity is derived from the observed B-band 
flux density, and a K-correction term is derived based on the 
redshift of each source (photometric or spectroscopic) and its 
UV spectral slope. Then L(1500A) is converted into a SFR 
using Eq. 5 of D04b, i.e.: 

SFR(MQyr-i) =Li5oo[erg s-1Hz-1]/(8.85 x 10^^). (7) 

The amount of dust reddening is estimated adopting a a 
Calzetti et al. (2000) reddening law (see also Meurer, Heck- 
man & Calzetti 1998), and using the relation from D04b: 

E{B-V)=Q.25{B-z + Q.I)ab. (8) 

An attenuation ofAi5oo = 10x£'(B — V) mag is then adopted 
to obtain dust-corrected SFRs from Eq. 8. With this proce- 
dure we obtain reddening corrected SFRs consistent to what 
derived with SED fitting (D04b) for the bulk of /^-selected 
z^2 galaxies. 

Clearly, this technique is affected by several limitations. In 
applying a reddening correction (which median is a factor of 
~ 40 for K < 20, z — 2 galaxies), the technique assumes that 
the UV spectral slope is entirely due to reddening, rather then 
to the presence of evolved stellar populations. Therefore we 
expect it should only work for actively star forming galaxies. 
Even more importantly, it is well known that strongest star- 
bursts are opaque to UV radiation and its total SFR activity 
cannot be reliably estimated solely from the UV, even after 
reddening corrections (e.g., Goldader et al. 2002; Buat et al. 
2005). Things can be different though at higher redshifts, as 
shown later in the paper (see also Flores et al. 2004; D05b; 
Reddy et al. 2005). 

3.7. IMF related effects 



It is worth mentioning to which extent the analysis pre- 
sented in this paper depends on the adopted IMF, i.e., a 
Salpeter (1955) IMF from 0.1 to 100 Mg. AH of the luminosi- 
ties that we use as SFR indicators are sensitive to the popula- 
tion of high mass stars ( ^ 5M0), but the bulk of the mass in 
stars that is actually formed is dominated by lower mass stars. 
Therefore, changing the slope of the low-mass end of the IMF, 
e.g., adopting the IMF of Kroupa (2001) or Chabrier (2003) 
that are in better agreement with current observations, would 
just imply lower overall SFRs, i.e., different normalizations in 
Eqs. 1 and 7 and different stellar masses (by nearly the same 
factor). We prefer to quote numbers for Salpeter IMF for con- 
sistency with much of the literature in the field. 

However, changing the intermediate or high-mass slope 
could have a significant effect on the relative conversions be- 
tween different SF indicators, since stars in different mass 
ranges can be responsible for varying contributions to the 
different SF indicators. For example, the radio emission is 
thought to be powered by higher mass stars than the IR or the 
UV emission (e.g., Condon et al. 1992), and a steeper IMF 
slope at the high masses would systematically increase the 
IR/UV over radio luminosities. Timescale issues, e.g., ob- 
serving a starburst very close to the beginning (Roussel et 
al. 2003) or a long time after the SFR peak, can also alter 
the luminosity ratios. Conceivably, IMF and star formation 
timescale are partly degenerate for some of the effects that we 
discuss later in the paper We neglect timescale effects in the 
following of the paper. 

3.8. Limiting depths and effect of photometric redshifts 

Fig. 13 shows the limiting sensitivity to bolometric IR lu- 
minosity from the various tracers using the recipes described 
in the previous sections (deep radio and SCUBA photometry 
are available only in GOODS-N). It is clear that the MIPS 
24 /im flux density is the most sensitive IR tracer by at least 
an order of magnitude, allowing in GOODS to detect galax- 
ies with Lir ^ 10' for most of the 1.4 < z < 2.5 redshift 
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range considered here. This is illustrated in Fig. |4l showing 
that the fraction of z ^ 2 galaxies in our sample detected at 
24 fim remains very high, over 60% even at the K <22 limit 
of the GOODS-S field. Even lower SFRs can in principle 
be derived from the UV over the whole redshift range, pro- 
vided there is no or just small dust obscuration. An unred- 
dened galaxy at z = 2 with B = 21 AB {3<t limit) would have 
SFR= 0.6 Mft yr^'. A more practical limit is given by the 
faintest galaxy detectable in the ^T-band {K = 22) having no 
dust reddening, which would have SFR'^ 10 yr^' for 
z = 2, similar to the depth reached at 24 /im. In the case 
of galaxies with non zero dust reddening we can measure 
the SFRs only above higher limits. For example, a galaxy 
with B = 21 and E{B -V) = 0.4 (the median value at the 
bright K<2Q limits) would have SFR - 25 Mq yr"' at z = 2 
(Fig. O. It is therefore quite interesting to compare UV and 
24 /im based SFR and Lir estimates, given that these are the 
most powerful tracers, and available for nearly all of the z ~ 2 
galaxies in our GOODS sample. 

The dependence of Lir with redshift is generally steep, and 
varying as a function of redshift itself, except for SCUBA 
850 /im photometry. In turn, photometric redshift errors prop- 
agate into Lir and SFR errors differently when different trac- 
ers are used. We note, however, that this is mitigated when 
considering the ratios of Lir derived from two independent 
tracers, as is generally done in this paper. Even in this case, 
however, one can commit significant errors in cases of catas- 
trophic redshift failures, e.g., if a galaxy at z < 2 is mistaken 
to be at z '--^ 3. In this case, the Lir derived from the 24 /im 
or 70 /im data could be strongly overestimated relative to that 
derived from, e.g., 20 cm data or from the UV, due to the 
stronger K-corrections for the emission in the MIPS band- 
passes. We take these effects into account when comparing 
the SFRs from different tracers. In general, the high quality 
of the adopted photometric redshift (see Fig. [U should limit 
this problem to a small minority of cases. In addition, we use 
the extensive spectroscopic redshift sample to consolidate any 
conclusion first derived using the full combination of spectro- 
scopic and photometric redshifts. 

4. IR BOLOMETRIC LUMINOSITIES FROM MID-IR, SUB-MM AND 

RADIO 

4. 1 . Bolometric luminosities from the 24 fxmflux densities 

Fig. |5] shows the flux density versus redshift relation of 
MIPS 24 /im detected sources in the two GOODS fields, to 
the corresponding A'-band limit in each field. Fig. |6] shows 
the bolometric IR luminosity inferred from the 24 /im flux 
densities using the CEOl templates. From both sets of fig- 
ures it is apparent that moving to higher redshifts sources with 
brighter and brighter Lir are found, implying (after account- 
ing for volume effects) a rapid brightening of the IR luminos- 
ity function (as already emphasized in the literature; e.g., Le 
Floc'h et al. 2005; Caputi et al. 2007). However, it is some- 
what surprising that the rise extends all the way to redshift 
z ~ 3, and sources with Lir ~ 10 '^Lq and even Lir ~ 10 ''^L© 
appear in the GOODS fields (see Peres Gonzales et al. 2004; 
Papovich et al. 2006; Caputi et al. 2006 for similar results). If 
due to star-formation, these extreme luminosities would im- 
ply obscured SFRs up to 10000 Mq yr^^. Such extremely 
high SFRs as inferred from 24 /im flux densities are some- 
what suspicious, suggesting that the mid-IR light may not be 
entirely powered by star formation, or may be reflect different 
intrinsic mid-IR properties of these z = 2 star forming galax- 
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Fig. 7. — Comparison, for ~ 2 galaxies in the GOODS-N field, of the 
total IR luminosity (Lir) as inferred from radio (using the local radio-LiR 
correlation; Condon 1992; Yun et al. 2001) to the 8 /jm rest-frame luminos- 
ity (derived from the 24 /jm observed flux density). Black and red symbols 
are as in Fig. [5] The green points show the results of stacking radio unde- 
tected sources in 3 bins of 8 /^m luminosity. The blue points show the average 
trend versus 8 /im luminosity, including both radio detected and undetected 
sources. The solid lines show the expected correlation based on the CEOl 
and DH02 templates. Hard X-ray detected AGNs are excluded. 



ies, with respect to local sources. 

4.2. MIPS 24 fim versus radio luminosities 

Bolometric luminosities derived from the 24 /tm flux den- 
sities are compared to those derived from radio in Fig.|7] for 
GOODS-N z ~ 2 galaxies. For radio detected sources (about 
21% of the total sample to K = 20.5) the average galaxy has 
Lir ^ 10 '^Lf) and there is overall agreement between 24 fim 
and radio derived luminosities, for both CEOl and DH02 tem- 
plates. However, a large scatter is present and there are galax- 
ies with 24 /im to radio inferred Lir ratios much different from 
unity in both directions. We will come back to these sources 
later in this section. For 94% (46/49) of the galaxies with 
L(8/iOT) > 2 X 10' ^Lg,, the radio detection (or limit) implies a 
higher Lir estimated from 24 /im with respect to radio, using 
either CEOl or DH02 templates. 

We used radio stacking in order to investigate trends for 
radio-undetected sources. Green symbols in Fig. |7] show 
the result for radio-undetected sources, stacked as a function 
of the 8 /im rest-frame luminosity, in turn derived from the 
24 /im flux density. Stacked flux densities are in the range of 
3-7 /iJy, comparable to or larger than the single beam noise, 
and the signal to noise of the detections is larger than 3 in all 
cases. The stacking emphasizes the existence of a substantial 
population of sources with L(8/im) > 2 x IO^'Lo where the 
Lir estimated from 24 /im exceeds on average by an order of 
magnitude the same quantity derived from radio. When con- 
sidering together radio detections and non-detections, we see 
a good agreement at low L(8/im) luminosities and an overall 
increasing discrepancy of higher L(8/im)/LiR with respect to 
both CEOl and DH02 templates. For L{8fim) > 2 x lO^L© 



z — 2 star-formation in GOODS 



9 



10 



L,„(24/j,m)/LQ(CE01) 
10" 1012 10'- 




DH02 _ 



10" 
L(8Mm)/L<, 



10 



lO'' 



o 



:10'2 - 



4 
o 



10' 



L,K(24/j,m)/Lg(CE01) 
10" 10'2 101' 




DH02 _ 



10' 

L(8Mm)/Lg 



Fig. 8. — Compai'ison, for z ~ 2 galaxies, of the total IR luminosity (Lir), or limits, as inferred from 70 ^m Spitzer photometiy to the 8^m rest-frame 
luminosity derived from MIPS 24fj,m. The left panel is for GOODS-N, while the right panel is for GOODS-S. The solid lines show the expected correlation 
based on the CEOl and DH02 templates. Black and red squares are for individual detections, symbols are as in Fig. |5] The green point show the results of 
stacking 70 fim undetected sources in 3 bins of 8 /im luminosity. Hard X-ray detected AGNs are excluded. 
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Fig. 9. — The comparison of SCUBA 850 /^m inferred Ljr for z ~ 2 
objects to the 24 fim based luminosities. For the 5 SMGs (red squares) we 
have photometric redshifts 2.2 < z< 2.8. The green points show stacking of 
850 fj-m undetected sources. Hard X-ray detected AGNs are excluded. 



[LiR(24/iTO) > 4 X 10 Lq], the average overestimate is about 
a factor of 3. It can be noticed from Fig. [T] that radio hmits 
move to higher Ljr in the right part of the plot. This corre- 
sponds to the fact that most of these putative HyLIRGs (from 
24 fim) tend to he at higher redshift than average. 
There are eight galaxies, spread over a range of mid-IR and 



radio luminosities, that on the contrary show a radio-derived 
bolometric luminosity that exceeds by a factor >, 3 (and up 
to ^ 30) that inferred from L(8/im). This could be due to 
additional radio emission from an AGN (despite non detec- 
tion at X-ray wavelengths), as suggested also by Donley et al. 
(2007), mostly in lower redshift sources. Some sources might 
have 24 /im deficit, as expected in the case of very low metal- 
licity (Engelbracht et al. 2005; Madden et al. 2006), or in the 
case of a strong 9.7 fim silicate absorption entering the 24 /im 
bandpass. The latter case can happen for z ^ 1.5, and we do 
find that 6 out of the 8 galaxies are indeed at 1 .2 < z < 1 .8. 

4.3. MIPS 24 fim versus MIPS 70 fim 

We have cross-checked this result using additional Ljr indi- 
cators. Fig. [8]panels show the comparison with luminosities 
inferred from the MIPS 70 /im flux densities from the sur- 
vey of Frayer et al. (2006; and in preparation). In the area 
with available deep 70 /im imaging from Spitzer, about 200 
arcmin^ from the two fields, there are only a handful of indi- 
vidually detected galaxies atz^tl .4. The 70 fj,m imaging has 
reached much shallower Lir limits than the 24 /im data; how- 
ever the non-detections are meaningful for the sources with 
higher Ljr as inferred from the 24 /im data and imply that 
Lir from 24 /im is overestimated in those sources. Similarly 
to radio, we have performed stacking of galaxies at 70 fim 
in both GOODS fields, binning the individually undetected 
galaxies as a function of their 8 /im rest-frame luminosities 
(see green boxes in Fig. [8]l. Again, the most notable feature 
is that sources with the brightest 8 /im rest-frame luminosities 
are not correspondingly brighter at 70 /im, as would be ex- 
pected on the basis of local templates. The average 70 /im to 
24 /im flux density ratio is ~ 6 — 7 for sources with L(8/im) ~ 
1O"L0, and of- 3 for those with L(8/im) > 2 x lO^Lg. Tem- 
plates from CEOl and DH02 predict instead that the 70 /im 
to 24 /im flux density ratio at z 2 should steeply increase 
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Fig. 10. — The 24 /im inferred luminosities are compared to the UV hght 
from star-formation reprocessed into the IR (SFRuv.obscured)- The latter is 
defined as the difference between the dust-con'ected and the uncorrected SFR 
estimated from the 1500A rest-frame luminosities. The blue points show 
the behavior of the median galaxy, for the usual 3 bins of 8fim rest-frame 
luminosity. Hard X-ray detected AGNs are excluded. 



with intrinsic Lir. A similar trend is found by Papovich by al. 
(2007) in z 2 galaxies. 

Notice that contrarily to the radio case, for a given Lir the 
70 fim to 24 /im flux density ratio is fairly independent on the 
redshift, for a given template, in the range considered includ- 
ing 2 < z < 3, due to more similar K-corrections. Therefore 
these conclusions are relatively robust against uncertainties 
coming from photometric redshifts. 

4.4. MIPS 24 fj.m versus SCUBA 850 

Finally, we have used the SCUBA super-map from Borys 
et al. (2003) in order to investigate the comparison of 850 /im 
versus 24 /zm inferred Lir values for z ^ 2 galaxies. Only 
5 galaxies in the sample are individually detected, all at the 
highest luminosities. For 4 of them the bolometric luminosi- 
ties inferred from the 24 /xm and 850 /im flux densities are in 
good agreement with expectations based on local templates, 
while in a single case the 24 /xm flux density lead to over- 
predicting LiR by over an order of magnitude compared to 
SCUBA. It is interesting to notice that this result is different 
from the findings by Pope et al. (2006) for 850 /im selected 
galaxies, where for typical galaxies the 850 /im flux density 
was in excess of what could have been predicted from the 
24 jim flux density. Our 5 sub-mm detected BzK galaxies rep- 
resents actually a small part of the Pope et al. (2006) sam- 
ple. The most likely explanation of the differences is that the 
sub-mm selection favors galaxies with relatively cold dust, as 
emphasized also in Pope et al. (2006). 

We stacked 850 /im undetected sources as a function of 
their 8 /im luminosities. We detect the intermediate and 
highest luminosity bins only, both with S/N=4 and 5850;^m ~ 
1.3mJy. This is consistent with the result presented in D05b 
and Knudsen et al. (2005) for SCUBA stacking of distant 



sources. This also confirms the result that for the most 
luminous 8 /im rest-frame galaxies, the 8 /im luminosity is in 
excess of what the local templates would predict given the Ljr 
of the galaxies estimated from other means. As to first order 
the 850 /im flux density is independent of redshifts for IR lu- 
minous galaxies within the probed redshift range, finding the 
same 850 /im for the two bins shown in Fig. 9 while their aver- 
age 8/im luminosity differers by a factor of ~ 4 testifies again 
for a large scatter in 8 jim to bolometric Lir luminosity ratio 
for high-z LIRGs and ULlRGs in our K-selected sample. 

5. SFR FROM UV LIGHT: 'TRANSPARENT' ULIRGS AT z ~ 2 

D05b found that the UV estimated SFRs, averaged over the 
full sample of BzK star forming galaxies with K < 20, was 
well within a factor of 2 of the average of the estimates de- 
rived from the radio, mid-lR, X-rays and sub-mm data. This 
suggested that -at least on average- UV is a a good tracer of 
star-formation for massive z ~2 galaxies, as advocated e.g., 
also by Reddy et al. (2005). We investigate here to which ex- 
tent this remains vaUd when considering individual galaxies. 

5.1. UV versus MIPS 24 iim 

Fig. [TOl shows the UV estimate of the obscured SFR as a 
function of the 8 /im rest-frame luminosities, for the GOODS- 
S field only where we reach much fainter star-forming galax- 
ies. This quantity is defined as: 

SFRuv .obscured 

= SFRuv.corr — SFRyV .uncoiTi (9) 

where the correction is for dust extinction, as described in 
Section 3.6. The term SFRuv^obscured is equivalent to SFRir in 
Eqs. 1 and 2. Moreover, entering with this quantity into Eq. 1 
and solving for Ljr one gets the expected infrared luminosity 
as due to the partial obscuration of the star formation and the 
reprocessing of its UV photons, i.e., Lin{UV), also plotted 
in Fig. [To] For this comparison we limit to sources detected 
at 24 /im and to galaxies where the {B — z)ab color is well 
estimated, with total error below 0.4 mag (corresponding to a 
maximum acceptable error of a factor of 2.5 in the reddening 
correction to the SFR). At low 8 /im rest-frame luminosities, 
there is a fairly good agreement between IR light (and SFR) 
estimated from UV and from the 24 /im flux density, which 
is within a factor of 2 for most galaxies. The two estimates 
start to diverge for L(8/im) 4 x lO'^L©, and by L(8/xm) > 
2 X lO'^L© the UV underestimates the IR light by more than 
a factor of ~ 6 compared to the mid-lR. However, given the 
discussion in the previous section, it is not clear how often 
this is due to an underestimate of the obscuration of the UV 
light, or to an excess emission at 24 /im. 

5.2. UV versus radio 

In order to further check the reliability UV-derived SFRs 
we compared UV-based estimates to radio observations. For 
radio detected sources (20% of the sample) we find that the 
median ratio between the IR luminosity estimated from the 
radio to that from the UV is about a factor of 2, as shown in 
Fig. [TT] This is likely the effect of the limiting flux density 
in the radio, such that only the sources with brightest con- 
tinuum are detected (i.e., the Malmquist bias). When stack- 
ing radio-undetected sources as a function of UV inferred 
Lir, and considering together radio-detected and undetected 
sources. Fig. [TTI shows that the ratio of UV to radio Lir's ap- 
pear to be very close to unity at all UV luminosities, within 
the errors. This implies that the UV-based SFR, corrected for 
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Fig. 1 1 . — The comparison of radio to UV estimated total IR luminosities. 
Green points show the stacking of radio undetected sources in the 3 bins de- 
fined by the dotted vertical lines. Blue points show the average of all galaxies 
(including detections) in the same 3 bins. Hard X-ray detected AGNs are 
excluded. 
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Fig. 12. — The ratio of mid-lR and radio inferred SFRs to UV inferred 
ones. Green points are the stacking of radio undetected sources in the 3 bins 
defined by dotted lines. Number of sources stacked in each bin is reported. 
Hard X-ray detected AGNs are excluded. 



dust reddening, is on average a good tracer of SFR even for 
the case of ULIRGs with Lir > lO^^^©. 

On the other hand, there are also evidences of extreme ob- 
scuration at UV wavelengths. For 2-3 sources in the sample 
the UV underestimates Ljr by a factor up to 10-30 (some of 
these objects might be radio loud AGN, despite their non de- 
tection in the X-ray). For the most luminous sources with 
LiR > 2 X lO'^ based on radio (a limit above which we should 
have a complete sample over GOODS-N) the median galaxy 
has UV underestimating Lir by a factor of 2.7. For the typi- 
cal (or median) ULIRG in our sample, however, the UV ap- 
pears to provide a fairly good estimate of Lir, with no sys- 
tematic trend/offset with respect to the radio estimates. This 
is quite different from local ULIRGs, where the UV is known 
to underestimate Lir by factors ^ 10 (Trentham et al. 1999; 
Goldader et al. 2002) even after correcting for dust extinction. 

Similarly, for z ^ 2 SMGs the UV underestimates SFRs 
by an average factor of 120 (Chapman et al. 2005). For the 
5 galaxies in GOODS-N to K = 20.5 that are detected by 
SCUBA at 850 /im, the 3 objects where the UV slope can 
be estimated with relatively good S/N the ratio of radio- to 
UV-derived SFR is in the range between 3 and 35. For the 
other two objects, with a poorer quality UV-derived SFR, this 
ratio is also ~ 30. As we are considering here a subsam- 
ple of SMGs with bright near-lR magnitudes, it is likely that 
we are limiting to the least obscured sources in their class, 
hence these results appear to be consistent with Chapman et 
al. (2005). 

All in all, there seems to be evidence that the typical z^2 
ULIRG is fairly 'transparent' to UV light, the word meaning 
that we can estimate its SFR given its rest-frame UV 1500A 
luminosity and UV slope. However, at much brighter bolo- 
metric luminosities, well in excess of IO'^Lq, the UV starts 
to saturate, as inferred from the brightest radio or sub-mm 



galaxies. There appears to be a shift of UV saturation level 
from about Ljr — 10^ 'Lq at z = to Lir greater than a few 
lO'^Lf: at z = 2. This is in line with the overall rise of the 
SFR density of the Universe during these epochs. 

Thus, it appears that the UV is on average performing well 
for the Lir <, lO'^L© z = 2 galaxies. However, for the dis- 
crepant cases with Lir (24 /im) ^ LiR(L'^y) (see again Fig.fTOll 
it remains unclear whether the discrepancy is due to excess 
obscuration or to an intrinsic 24 /im excess. To disentangle 
this ambiguity we have again used the radio as a diagnostic 
tool. In Fig. [12] we show the total SFR [from Eq. 2 using 
LiR(24/im) as input into Eq. 1] of the radio-derived SFRs, 
both normalized to the reddening corrected SFR from the UV. 
For the sources whose SFR derived from the 24 /^m flux den- 
sity exceeds by more of a factor of 3 that derived from the 
UV, we find that only ~ 10-15% of them have a similar or 
larger excess also in the radio (and thus could be represent 
ULIRGs with anomalously high obscuration). For the major- 
ity of 24 /im excess objects, however, we find again that radio 
SFRs agree with UV SFRs, suggesting again that it is 24 /im 
flux density to be genuinely in excess also with respect to the 
UV luminosity. 

5.3. Reliability of reddening correction and quiescent 
galaxies 

The discrepancies that have emerged from the comparisons 
in the previous sections lead to the question of how well we 
can correct UV luminosities for dust reddening, given the 
adopted reddening law, and if the mid-lR excess is somewhat 
related to dust reddening and UV slopes. Fig. [13] shows the 
ratio of "total" SFR as from Eq. 2 estimated from the mid- 
lR using the 24 /im flux densities over the UV-corrected SFR, 
as a function of the estimated reddening from Eq. 8. There 
appears to be no appreciable trend of a larger such ratio as a 
function of reddening. This implies that if excess obscuration 
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Fig . 13. — The ratio of SFR inferred from 24 /im (plus the UV uncorrected 
one) to the SFR derived from UV only and corrected for dust extinction using 
a Calzetti law, is plotted as a function of the reddening E(B-V) infen'ed from 
the UV slope (thi'ough the {B — z)ab color, as defined in D04b). We classify 
galaxies above the dashed horizontal line as mid-IR excess galaxies. The 
nature of such sources is investigated in detail in Paper II. 



is present over what corrected using the Calzetti et al. (2000) 
law, then the UV slope has to be very poorly correlated with 
such an excess obscuration. A grey obscuration would work, 
but all known dust absorbers are not grey. On the other hand, 
for the average galaxy, correcting the observed UV luminosi- 
ties based on the UV slope and using the Calzetti et al. red- 
dening law, allows us to obtain fairly robust SFR estimates, as 
indeed illustrated by Fig.[T3] 

There are a few galaxies where the reddening corrected SFR 
from the UV exceeds that derived from the mid-IR by more 
than a factor of 3. As opposed to 24 /im excess objects. Fig. [12] 
shows that in most of these cases it is the UV that is overes- 
timating SFRs. These latter ones are mostly confined to the 
sources with reddest colors, and are likely galaxies in a post- 
starburst phase that are on their way to turn passive/quiescent. 
In these cases, the redness of the UV continuum is due to a 
dearth of young/massive stars rather than to dust reddening, 
and the SFR inferred from the UV, corrected for dust extinc- 
tion following the recipes of Section 3.6, is obviously overes- 
timated. These sources are intermediate cases between pas- 
sive BzK galaxies and star-forming ones, and similar to the 
two blue systems discussed in Daddi et al. (2005a). The frac- 
tion of these systems is fairly small, implying a rapid tran- 
sition from the star-forming stage to the passive one. This 
is related to the possible presence inside sBzK samples of 
sources with passive or quiescent stellar populations, as em- 
phasized by Grazian, Nonino & Gallozzi (2007) and Quadri 
et al. (2007). The 25% discrepancy between average radio 
and UV estimates of Ljr for the most luminous UV galax- 
ies (see the blue points in Fig. fTTT i is completely solved by 
excluding these nearly quiescent sources (about 20% of the 
radio undetected galaxies in that bin). Clearly, MIPS 24 /im 
photometry is required to identify these Ukely post starburst 



galaxies, where the UV corrected for dust extinction provides 
an incorrect estimate of the ongoing SFR activity. 

5.4. The nature of mid-IR excess galaxies 

The results of the previous sections have clearly singled 
out the existence of a population of z ~ 2 galaxies that we 
call mid-IR excess galaxies, for which the MIPS 24 fim flux 
densities would imply IR bolometric luminosities and SFR in 
excees to those implied by all other available tracers of star 
formation activity. We emphasize that the existence of these 
mid-IR excess galaxies is independent on the classes of tem- 
plates used or on the particular recipe to convert an observed 
24 /im flux density at z ^ 2 into a measure of SFR/Ljr. The 
result shown in Figs. 7-10 can be read as evidence that at a 
fixed bolometric luminosity around Lir k, lO'^L© there is a 
very large scatter of L(8/im), and one can objectively define 
mid-IR excess galaxies as the galaxies with the largest ob- 
served L(8 /im) for a givel Ljr. 

Understanding and investigating the nature of these sources 
and the origin of the mid-IR excess has potentially far reach- 
ing implications. These aspects are directly developed in full 
in Paper II. In such paper we provide evidence indicating 
that mid-IR excess galaxies contain heavily obscured, often 
Compton Thick, AGNs. We then argue that the AGN ener- 
getic input is the most likely cause of the mid-IR excess in 
these galaxies. 

6. STAR FORMATION RATES IN z = 2 GALAXIES: A RECIPE 

We summarize the work described in the previous sections 
by suggesting a recipe for obtaining the best estimates of 
SFRs for samples of z ~ 2 galaxies, by exploiting a variety 
of multiwavelength SFR indicators as available in GOODS 
(and many of the current surveys, although to less deep levels, 
such as e.g., in COSMOS, Scoville et al. 2007). This proce- 
dure assumes that galaxies hosting relatively unobscured and 
luminous AGN have been eliminated from the sample, e.g,. 
by means of deep X-ray imaging. 

1) Compare UV-derived SFRs (S¥R{UV,corr)), obtained 
as described in Section 13.61 to the ones obtained from 
the 24 jim flux density, using the CEOl or DH02 tem- 
plates. For the sources where SFR.{mid — IR + UV) and 
SFR{UV,corr) agree within a factor of ~ 3 (Fig. [T2b. 
use the 24 /im based SFR plus the UV-based SFR un- 
corrected for reddening (i.e., SFR{mid — IR + UV)) as 
the best estimate for the galaxy SFR. These sources are 
called mid-IR normal star forming galaxies. 

2) For the sources where S>¥R{mid — IR + UV) is lower 
than about one third of SFR([/y, corr), use only the 
24/im based SFR (SFR(m/ii — IR)) as the best estimate. 
These sources are not likely to have the UV/optical light 
dominated by young massive stars, and are probably 
quiescent or post starbursts (Fig.[T2b. 

3) In the cases where S>¥K{mid — IR + UV) is larger than 
about 3 times SFR{UV,corr), i.e., for mid-IR excess 
galaxies, use SFR{UV,corr) as the best SFR indicator, 
as it is likely that there is a contribution to the 24 /im 
flux density from other than star formation. Availability 
of deep radio imaging and/or submm/mm imaging, can 
allow one to identify the subsample of galaxies among 
these that are actually opaque to UV light, and longer 
wavelength estimates of SFR are to be used in the latter. 
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Fig. 14. — The same of Fig. 10, but distinguishing K-bm\d faint (left panel) and bright (right panel) galaxies. A clear shift in the mean trend between mid-IR 
luminosity and SFR is detected. 



To our understanding these recipes makes the best possi- 
ble use of the multiwavelength database currently available, 
in order to cope with the limitations of each individual star 
formation indicator In particular, they are meant to single 
out those sources where a particular method is bound to fail, 
e.g., because the UV slope reflects age rather than reddening 
effects, or the mid-lR output is partly powered by an AGN. 

In principle, the similarity in the trends observed in Fig. 7- 
10 could suggest to define modified, highly non linear rela- 
tions between 8 fim luminosities and Lir for z — 2 galaxies, 
as suggested also by Papovich et al. (2007), for recovering 
corrected Lir for all galaxies. However, Fig.[T4]shows that in 
practice any such relation would be dependent on the sample 
properties itself, e.g. shifting as a function of /T-band magni- 
tude, and therefore not well defined in general. 

7. IMPLICATIONS 

In this section, we exploit the results presented so far, and in 
particular the improved SFRs for our sample of star-forming 
galaxies at z ~ 2, in order to address a number of issues 
which are relevant for our understanding of the nature of star- 
formation and mass assembly in such galaxies. 

7.1. The cosmic evolution of ULIRGs 

As already shown by D05b (see also Papovich et al. 2006; 
Reddy et al. 2006a; Caputi et al. 2007), the typical massive 
(M > 10' 'Mq yr"') star-forming galaxy at z = 2 is a ULIRG, 
i.e., it appear to have Lir ^ lO'^L© or more. Here, we pro- 
vide a more detailed estimate of the space density of z = 2 
ULIRGs using multiwavelength information. In GOODS- 
N there are 113 BzK galaxies that are classified as ULIRGs 
based on their 24 jjm flux densities, 58 based on UV SFRs, 
and 45 based on radio emission (or up to 69 if considering the 
undetected radio sources with limits still consistent with the 
ULIRG regime). Similarly, in GOODS-S there ai-e 155 BzKs, 
classified as ULIRGs from their 24 jim flux densities, and 
50 from the UV SFRs. At luminosities of Lir = lO'^L© the 



mid-lR starts to be systematically in excess of radio (Fig.|7]l, 
and therefore ULIRG densities derived from MIPS 24 jim 
data are likely to be overestimated. We notice however that 
the number of ULIRGs inferred from MIPS 24/im emission 
does not increase appreciably when moving from the sample 
with K < 20.5 in GOODS-N to the one Hmited a.t K < 22.0 
in GOODS-S. On the other hand, there can be some sources 
that are not classified as ULIRGs in the UV due to obscura- 
tion. Also, the radio flux density limits over this redshift range 
are very close to the ULIRG detection threshold, and hence a 
radio-based estimate could be somewhat incomplete. There- 
fore, we average the UV/radio and MIPS estimates to derive 
ULIRG space densities at z 2. Combining GOODS-N (154 
arcmin^) and GOODS-S (140 arcmin^ used in this work), 
and correcting the ULIRG counts to compensate for the fact 
that we are including only sources not blended in the IRAC 
images, we finally derive a sky density of ULIRGs of 0.6 
arcmin^^. Using the comoving volume within 1.4 < z < 2.5 
(Fig. lU, this implies a space density of 1.6 x 10^^ Mpc^^. 
The uncertainty in this density is order of 0.2 dex, when ac- 
counting for the scatter in the ULIRG counts from the differ- 
ent Lir tracers, as mentioned above. We note that we have 
excluded galaxies with hard X-ray detections from this anal- 
ysis as likely hosting AGNs. If we use the 24/im emission for 
the X-ray AGNs to estimate Lir, the sample of ULIRGs in- 
creases by 15-20%, thus still within the range of uncertainties 
estimated above. The space density of ULIRGs at z = 2 is a 
factor of 1000 larger than in the local Universe (e.g., Sanders 
er al. 2003), confirming the previous estimate of D05b. 

In a similar way, we derive the space density of ULIRGs in 
GOODS at z = 1, taking galaxies with 0.7 < z < 1.3 (using 
photometric and spectroscopic redshifts) and estimating Lir 
using 24 jim flux densities. We find only 7 such sources in 
the combined GOODS, which implies a space density of 2 x 
10^^, an order of magnitude lower than that at z = 2.'^ 

Adding hard X-ray detected galaxies only marginally increases the esti- 
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Fig. 16. — The number counts of passive galaxies is plotted versus K-band 
magnitude. The red points show the pBzK counts from the two fields of the 
large area survey of Kong et al. (2006). Predictions obtained from the mock 
lightcones based on the Millennium simulation (Kitzbichler & White 2007) 
are also shown, for galaxies with z > 1-4 and SFR < 1 and 10 Mgyr"' . 



In Fig. [15] we compare these results to predictions from the 
Millennium simulation of galaxy formation (Springel et al. 
2005). We use the lightcones based on semianalitic models 
presented by Kitzbichler & White (2007). For a Kroupa et 
al. (2001) or Chabrier (2003) IMF the ULIRG regime corre- 
sponds to a star-formation rate of 12OM0yr^', lower than the 
rate reported in Eq. 1 for a Salpeter IMF. We considered all 
simulated galaxies with SFR above this threshold as ULIRGs. 
We explicitly quote here results for the Kroupa (2001) and 
Chabrier (2003) IMFs because these are favored by observa- 
tions in the local Universe and therefore more appropriate for 
comparison to predictions of SFRs in galaxies. The model 
underpredicts the space density of ULIRGs at both z— I and 
z = 2 by at least an order of magnitude. Only at higher red- 
shifts, 3 < z < 5, do the Millennium models predict a ULIRG 
space density as large as that which we observe at z « 2. This 
suggests that star-formation in massive galaxies occurs too 
early in the simulations, or was terminated too early due to 
the adopted truncation of SFR from AGN feedback (e.g., Cro- 
ton et al. 2006). However, we reach similar conclusion when 
comparing to simulated galaxy catalogs based on the model 
of Oppenheimer & Dave (2006), which also predicts a com- 
parably large space density of ULIRGs only at z 2. 

7.2. Counts of passive galaxies 

One possible means for reconciling the models with the ob- 
servations would be to delay star-formation until later epochs. 
Therefore, it is important to compare observations and model 
predictions for the abundance of passive, non-star-forming 
galaxies at z ~ 2. In fact, the feedback from AGNs has been 

mated ULIRG space density also at z ~ 1 . 



implemented in the Millennium models in order to allow them 
to reproduce the existence of passive and massive galaxies 
both at high and low redshifts (de Lucia et al. 2006). The Op- 
penheimer & Dave (2006) models, lacking the crucial mech- 
anism of AGN feedback, predict very low space density of 
passive galaxies at z ~l-2, much below the observed density. 

In Fig. [16] we show the number counts of passive BzK 
ipBzK) galaxies from Kong et al. (2006). These objects, se- 
lected according to the criteria defined in D04b, are expected 
to provide a fair census of the population of z > 1 .4 passively 
evolving systems (see, e.g., Daddi et al 2005a; Arnouts et al. 
2007). In GOODS the B-band data are not deep enough to 
reliably distinguish passive from star-forming BzK galaxies 
using the color definition of D04b. However, we may approx- 
imate this by using the number counts of GOODS BzK galax- 
ies that have SFR < 10 Mpyr^^ based on the 24 //m measure- 
ments. As we have shown in Fig. [3] the MIPS 24 /im observa- 
tions are very sensitive for detecting low rates of star forma- 
tion, and in particular to identify galaxies with red UV slopes 
but relatively low SFRs (Fig.[T2l). The counts of galaxies with 
SFR < 10 Mgyr-i from the combined GOODS N&S fields 
are shown in Fig. [16] These are lower than the pBzK counts 
of Kong et al. (2006), most likely due to cosmic variance, as 
these sources are very strongly clustered (Kong et al. 2006). 
The larger COSMOS survey may provide more robust statis- 
tics on the abundance of passive galaxies, although we note 
that the 24 ^m data in COSMOS (Sanders et al. 2007) are 
shallower and cannot set such tight constraints on low SFRs 
atz w2. 

Overall, the Millennium simulations predictions, shown in 
Fig. [16] seem to reproduce the data reasonably well, given 
the large uncertainties, perhaps somewhat overestimating the 
counts of galaxies with relatively low SFRs. 
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7.3. The duty cycle of z ~ 2 ULIRGs 

The Millennium models might under-predict the ULIRG 
space density if the star formation episodes were short lived, 
so that the peak SFR is missed caught when averaging over 
the simulation time steps (250 Myr at z = 2 for the evolu- 
tion of the dark matter, with much finer steps for physical 
processes computed in the semi-analytic models). It is there- 
fore important to try to constrain the duration of the phase at 
high redshifts. This is also relevant in order to evaluate the 
contribution from this star-formation mode to the assembly 
of stars in massive galaxies. In D05b it was suggested that 
duty cycles are high, based on the very large detection rate 
and brightness of the most massive galaxies at 24 /im. Ca- 
puti et al. (2006) agree with this conclusion, based on their 
24 /im study of z = 2 galaxies in GOODS. We investigate this 
point in more detail here, in the light of our new results on the 
comparison of the different star formation estimators. 

In order to do this, we consider a mass- and volume-limited 
sample of BzK galaxies, and evaluate the fraction of ULIRGs 
in the sample. If ULIRGs stopped forming stars, they would 
remain in the mass-selected sample as passive or slowly star- 
forming galaxies. The fraction of ULIRGs in this mass- 
selected sample is therefore a lower limit to their duty cycle, 
as the ULIRG phase could have started well before the galax- 
ies reached the required mass threshold, or at an earlier era, 
before the redshift boundary of the sample considered here. 

To build a mass-limited sample of z ^ 2 galaxies, we con- 
sider all of the z > 1 .4 BzK galaxies, including both star form- 
ing and passive galaxies. The latter were excluded for the 
analyses in previous parts of the paper. We use the empiri- 
cal mass estimate described in D04b that is based on detailed 
SED fitting results with the Bruzual & Chariot (2003) models 
performed by Fontana et al. (2004). This calibration allows 
us to associate a mass to each galaxy based on its K-band 
magnitude, with a correction for varying M/L ratio based on 
the observed z — K color Maraston et al. (2006; 2007) have 
shown that for passive galaxies, where the overall SED is 
dominated by « 0.5-1 Gyr old stars, the use of the Maraston 
(2005) model libraries results in masses that are smaller by 
0.2-0.3 dex (see also Bruzual 2007). For star-forming BzK 
galaxies, where the young stellar component dominates the 
light, we find (Maraston et al., in preparation) that the Maras- 
ton models provide M/L ratios that are consistent on average 
with those from Bruzual & Chariot (2003). 

In the M > 10"M,:: sample constructed in this manner, we 
find 34 galaxies at z ~ 2 in GOODS-S and 38 in GOODS- 
N. Only 1/34 of the GOODS-S galaxies above this mass 
threshold has K > 20.5, which suggests that the sample is 
fairly complete also in the GOODS-N to this limit. The 
space density of massive galaxies with M > lO'^M© is about 
8 X 10^^ Mpc^^. This estimate is within a factor of « 2 of 
the one reported by van Dokkum et al. (2006) using the larger 
MUSIC survey, despite the different methods for estimating 
stellar masses. 

In order to estimate SFRs, we use both MIPS 24 /im and 
the UV luminosities corrected for reddening. While we have 
shown that SFRs inferred from 24 fj,m are suspect if they are 
substantially larger those derived from the UV data, the MIPS 
data are also very efficient for measuring relatively low SFRs 
in red galaxies where UV-reddening estimates would be in 
turn overestimated (see Fig. fT2]) . Therefore, the MIPS data 
are ideal for isolating the most quiescent galaxies even in the 
case that some low level residual SFRs place these sources in 



the star forming BzK region. 

By taking the minimum SFR for each object, estimated 
from either the 24 /im flux density or from the extinction- 
corrected UV luminosity, we find that the fraction of ULIRGs 
among the most massive galaxies at z ~ 2 is 39 ± 5% (Pois- 
son counting error only). This very conservative estimate sets 
a similar constraint on the duty cycle of the ULIRG phase in 
massive high-z galaxies, i.e., ^L40%. The cosmic time span 
elapsing during the redshift range 1 .4 < z < 2.5 is about 2 Gyr, 
with only 1 Gyr per galaxy available on average (given the 
reasonably flat redshift distribution within these limits). This 
implies that these ULIRG phase will last for at least some 
400 Myr, one order of magnitude longer than what is currently 
estimated for SMGs at similar redshifts, or for local ULIRGs 
(see, e.g., Greve et al. 2005; Tacconi et al. 2006; although see 
also Swinbank et al. 2006). 

7.4. The star formation mass correlation at z = 2 

We can gain further insights on the nature of star-formation 
at z = 2 by examining correlations between galaxy mass and 
star formation rates. Recently, Noeske et al. (2007) and El- 
baz et al. (2007) have shown that star formation and stellar 
mass define a tight correlation (0.2 dex scatter) in galaxies at 
z ^ 1, with rough proportionality (logarithmic slope of 0.9). 
Similar proportionality is also seen at z = in data from the 
Sloan Digital Sky Survey (Elbaz et al. 2007), although with 
a lower normalization reflecting the overall decline in cosmic 
SFR density with time. 

Fig.[T7](left panel) shows the SFR- stellar mass relation for 
galaxies at z ~ 2 in GOODS-S using the UV to derive SFRs. 
In this plot, we show only those sources with MIPS detec- 
tion, with 24 /im-derived SFR not less than 1/2 of the UV one 
(to avoid estimating SFR from UV for evolved galaxies, see 
Fig.fT2l) and with a well-defined UV slope for obtaining an ac- 
curate reddening estimate. We find that a proportionality with 
a logarithmic slope of 0.9 (blue line in Fig. [Tt] ). normalized 
to the median SFR/M* ratio of the sample (2.4 Gyr^'), pro- 
vides a good fit to the data, with semi interquartile range of 
only 0.16 dex in the SFR/M ratio. We have cross-checked the 
validity of this relation in GOODS-N using the deep radio ob- 
servations. We have divided the galaxies (limited to A' = 20.5, 
a smaller range than what plotted in Fig.fTT]) in 3 independent 
mass ranges and obtained the average SFR in the bin using the 
radio, averaging together radio detections and non detections. 
The green points in Fig.[T7]show that the star formation results 
from the stacked radio emission agree very well with the rela- 
tion that we derive from the UV light. This check is important 
because it is based on an indicator that should, in principle, be 
more solid than UV, and also because it allows us to include 
many of the massive galaxies for which UV SFR could not be 
estimated due to their faintness in the optical bands. 

Due to the substantial presence of mid-IR excess objects at 
z ~ 2, we do not expect that this relation can be accurately 
recovered using the 24 /im inferred SFRs. This explains why 
Caputi et al. (2006b) find a much looser correlation between 
SFR and masses for z = 2 galaxies. If we use 24 /im inferred 
SFR only for those galaxies where this quantity is within a 
factor of 3 of the UV one (the mid-IR normal objects), we 
obtain a SFR-mass correlation fully consistent with that based 
on UV (Fig.[T7l right panel). 

The z = 2 correlation appears to have a slope similar to that 
at lower redshifts. Instead, the normalization at z = 2 is a fac- 
tor of 3.7 larger than that at z = 1, and 27 times larger than at 
z = (Elbaz et al. 2007; Noeske et al. 2007). At fixed stellar 
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Fig. 17. — The Stellar Mass-SFR correlation for z = 2 star forming galaxies in GOODS. Points are taken from the deeper GOODS-S field to K = 22. We 
include only 24 fim detected galaxies: passive/quiescent galaxies are excluded from this analysis. Left panel (a) has SFRs derived from UV, corrected for dust 
extinction. Right panel (b) has 24 /im derived SFRs but we explicitly exclude all mid-IR excess galaxies. The large green dots are the result of average SFR-Mass 
relation in GOODS-N determined from radio stacking oi K < 20.5 galaxies in 3 mass bins. The blue fine is SFR= 200 x M";' [M^j yr"'], where Mu is the 
stellar mass in units of 10"M.,. The cyan solid lines are the z = 1 and z = correlations, taken from Elbaz et al. (2007), that have a similar slope of 0.9. The 
cyan dashed Hne is a prediction for z = 2 from the Millennium simulations, based on the mock lightcones of Kitzbichler & White (2007). The magenta star near 
the top shows the location of typical SMGs in this diagram. 
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Fig. 18. — The same of Fig. 1171 but here simulated galaxies at z = 
1.90 ±0.05 are plotted, taken from the Millennium lightcones of Kitzbich- 
ler & White (2007). The cyan line is a power law with slope of 0.65, and the 
blue solid line is the saine of the one in Fig. 1171 showing the average trend 
for the GOODS BzK sample at the same redshift. 



mass. Star forming galaxies were much more active on aver- 
age in the past. This is most likely due to a larger abundance 
of gas, depleted with passing of time. 



The inferred correlation is quite tight, with semi interquar- 
tile range of only 0.16 dex in the dispersion of specific SFRs. 
We caution that, having used mainly the UV as a SFR estima- 
tor, we cannot reliably rule out the presence of a larger number 
of outliers at low stellar masses, for which we might strongly 
underestimate the SFR from the UV. These cannot be reliably 
identified to meaningful depths with radio data (due to the flux 
density limits of current observations), nor at 24 fim (due to 
the existence of mid-lR excess sources). We will have to wait 
for Herschel and ALMA to address this issue accurately. 

Submillimeter-selected galaxies are strong outliers to this 
trend, however Tacconi et al. (2006) estimate that SMGs 
in their sample have typically Ljr — lO^^Lg, and dynamical 
masses ^ lO'^M© (see also Greve et al. 2005 for similar re- 
sults). For a given stellar mass (assuming that most of the dy- 
namical mass in the central regions of SMGs is stellar), SMGs 
are forming stars at a lOx or larger rate respect to ordinary 
massive star forming galaxies. Their space density is also ap- 
proximately an order of magnitude smaller. SMGs al z — 2 
appear to be like LlRGs and ULlRGs at z = 0, i.e., relatively 
rare objects and outliers of the mass-SFR correlation (see El- 
baz et al. 2007). To this regard, it is not surprising that much 
shorter star formation duty cycles and lifetimes have been in- 
fen-ed for SMGs and local ULlRGs (<C lOOMyr; Greve et al. 
2005; Solomon & Vanden Bout 2005). These represent short- 
lived stages of the life of galaxies, due, e.g., to ongoing merg- 
ers or some temporary perturbations (see, e.g., Dannerbaueret 
al. 2006 for a discussion). ULlRGs and SMGs also generally 
have smaller physical sizes (Tacconi et al. 2006) than those of 
more ordinary massive, star-forming galaxies at z = 2 (D04a; 
Ravindranath et al. 2007, in preparation), suggesting that the 
SMGs are in more advanced merger states. 

We have used the mock lightcones from Kitzbichler et 
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al. (2007), based on the Millennium simulations, to explore 
the comparison of mass and SFR at z = 2 in these models 
(Fig.fTSll. As emphasized already by Finlator et al. (2006) the- 
orethical simulations quite naturally predict the existence of 
correlations between galaxies SFRs and stellar masses. How- 
ever, we find that at fixed stellar masses, the model galax- 
ies are forming stars at about 1/4 of the observed rate for 
galaxies with M ^ 10^ 'M©. The correlation is also substan- 
tially tilted, with decreasing specific SFR at larger masses. It 
seems that a major change required for models would be to in- 
crease the star-formation efficiency (and thus the typical SFR) 
at all masses for star-forming galaxies at redshifts 0.8 < z < 3, 
while still keeping the current proportion of massive galaxies 
in a passive/quiescent state. Interestingly, we find that if we 
reproduce Fig. [TS] plotting simulated Millennium galaxies at 
z = 3 instead of z = 1 .9, we find that simulated galaxies match 
the z = 2 GOODS galaxies quite accurately, with the same 
SFR versus mass normalization, slope and with similar scat- 
ter. This again reinforces the idea that SFR and mass growth 
happens too early in the current version of the simulations. 

The observed mass-SFR correlation defines a more basic 
dichotomy of galaxy properties than those based on colors, 
as emphasized also by Elbaz et al. (2007). In future papers, 
we will investigate other physical properties of z = 2 galax- 
ies (like morphology) as a function of their distance from the 
correlation. Finally, we note that the trend of SFR increasing 
with galaxy mass, and the fact that star formation is sustained 
with long duty cycles, formally imply very rapid growth of 
the stellar mass during the period spanned by the observa- 
tions. We confirm the conclusions of D05b, that the epoch 
within 1 .4 < z < 2.5 corresponds to the major build-up period 
of the most massive galaxies in the local Universe. 

8. SUMMARY AND CONCLUSIONS 

We have investigated the properties of K-selected, star 
forming galaxies at z 2 in the GOODS fields, by means 
of over 1000 galaxies with \A 2.5, several hundred of 
which have measured spectroscopic redshifts. We have used 
the deepest observations at a variety of wavelengths, includ- 
ing radio, Spitzer 24 /im and 70 /im, SCUBA 850 /im and UV 
light, in order to derive and compare multiwavelength lumi- 
nosities of galaxies, and to investigate the reliability of these 
luminosities for measuring star formation rates. Our findings 
can be summarized as follow: 

• For most galaxies with moderate mid-IR luminosities, 
L(8/im) < 10^ 'Lq, Spitzer 24 /im measurements can be 
used to estimate total infrared luminosities and star for- 
mation rates of galaxies at z '--^ 2. The ratios of the 
mid-lR luminosities to those at other wavelengths (ra- 
dio, 70 /im, 850 /im and even reddening-corrected UV 
light) are consistent with expectations from local cor- 
relations, provided that their mid-IR luminosities is not 
too large. 

• However, most galaxies with L(8/im) > (1-2) x 
1O"L0 show a mid-IR excess with respect to all other 
luminosities, compared to that expected from local cor- 
relations, and on ratios observed in less luminous z = 2 
galaxies. We therefore report the discovery of a large 
population of z ~ 2 galaxies with intrinsic mid-IR ex- 
cess. We investigate the nature of these sources in 
a companion paper (Daddi et al. 2007; Paper 11 here- 
after), concluding that these objects contain heavily ob- 
scured, often Compton thick, AGN. 



• The presence of galaxies with seemingly extreme 
SFR > 1000 Mg) yr"' estimated from 24 /im flux den- 
sities is nearly always due to the existence of these 
mid-IR excess sources. A large proportion of 24 /im 
sources at z ^ 2 detected in Spitzer surveys shallower 
than GOODS will be mid-IR excess objects, and their 
star formation rates and infrared luminosities may be 
greatly overestimated if bolometric corrections from lo- 
cal starburst galaxy templates are applied. The true SFR 
of these objects rarely exceeds a few hundred Mg, yr^^ 

• For typical z = 2 galaxies, observed at fainter 24/im flux 
densities, the local correlations among the SFR tracers 
that we have probed here still hold within the uncertain- 
ties. 

• The observed UV luminosity, corrected for dust redden- 
ing based on the observed UV slope using a Calzetti 
et al. (2000) extinction law, allows one to obtain reli- 
able estimate of the SFR activity for a large majority 
of z = 2 galaxies. This is demonstrated in comparison 
with mid-IR estimates (except for sources with intrinsic 
mid-IR excess), and with radio, 70 /im and 850 /im ob- 
servations through stacking in bins of UV and mid-IR 
luminosity. The typical ULIRG at z = 2 is 'transpar- 
ent' to UV light, contrary to present-day ULlRGs and 
to z = 2 SMGs. 

• As exceptions, we do find a minority of z ~ 2 galax- 
ies which appear to be opaque to UV emission from 
star formation. Even after correction for dust redden- 
ing, the SFR derived from the UV light in these ob- 
jects is underestimated by large amounts. These ob- 
jects are similar to SMGs, and given the depths of cur- 
rent GOODS multiwavelength data, they are best rec- 
ognized as highly obscured outliers by comparing their 
UV and radio properties. 

• Similarly, there is a small population of K-selected 
sources that are classified as star forming galaxies based 
on optical-UV colors (e.g., with the BzK diagram) but 
for which the UV estimates of SFR greatly overpre- 
dict the true amount of SFR. These are probably post- 
starburst galaxies, whose red UV colors are not only 
due to dust reddening, but also have important contribu- 
tions from intrinsically cooler, lower-mass stars. Also 
these objects can be identified from the comparison of 
UV and mid-IR based SFRs. 

• Exploiting all available z = 2 SFR estimators, we have 
derived a measure of the space density of ULlRGs at 
z 2 in the GOODS fields. We find a space den- 
sity of about 2 X lO^'* Mpc^^ and a sky density of 0.6 
arcmin^^ at 1.4 < z < 2.5. This is a factor of 10 larger 
than the abundance predicted by semianalitic models 
based on the Millennium simulations. 

• Similarly, we have used our improved understanding 
of z = 2 SFR tracers to investigate the abundance of 
z ^ 2 massive galaxies with low ongoing SFR (below 
10 Mq-, yr^'). We have computed number counts of 
these galaxies as a function of their K-band magnitude, 
and find that the counts are lower than that of passive 
ipBzK) objects selected by Kong et al. (2006), likely 
due to cosmic variance and possibly to some contam- 
ination of pBzK samples from z < 1.4 dusty galaxies. 
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Predictions based on the Millennium simulations can 
reasonably well account for the space density of these 
galaxies. 

• We have built a mass-limited sample of galaxies with 
M > 1O"M0 at z ^ 2. By estimating the distribution 
of SFRs in these galaxies, we derive lower limits to the 
duty cycle of ULlRGs. This is defined as the fraction of 
ULlRGs inside the mass-limited sample. We conclude 
that this duty cycle is at least 0.4. This is much larger 
than the duty cycle estimated for local ULlRGs, and 
also for z = 2 SMGs. The relatively large duty cycles 
imply SFR duration at ULIRG levels of order of 0.5 Gyr 
or more. This also implies that large gas reservoirs have 
to exist in most of the massive z — 2 galaxies, or that gas 
must be accreted very efficiently over time. 

• We have found that a relatively tight stellar mass-SFR 
relation is akeady in place at z = 2, for star form- 
ing galaxies detected at 24/j,m, with a scatter (semi- 
interquartile range) of only 0.16 dex in the specific star 
formation rate. This has been derived mainly basing on 
the UV estimates of SFRs, and confirmed through radio 
stacking of galaxies in mass bins and using SFRs de- 
rived from the 24 /im flux density, once mid-lR excess 
objects are excluded from the sample. The typical star 
forming galaxy at z = 2 is forming stars more rapidly 
by factors of 3.7 and 27 respect to an object with sim- 
ilar mass at z = 1 and z = 0. Current numerical simu- 
lations are able to reproduce the mass-SFR correlation 



Alexander, D. M., et al. 2003, AJ, 125, 383 

Armus L., Charmandaris V., Bernard-Salas J., et al., 2006, |astro-ph/0610218| 
Arnouts S., Walcher C.J., Le Fevre O., et al., 2007, submitted to A&A 

(arXiv:0705.2438l 
B017S C, Chapman S., Halpern M., Scott D., 2003, MNRAS, 344, 385 
Bruzual, G. & Chariot, S. 2003, MNRAS, 344, 1000 

Bruzual G., 2007, Proceedings of the Meeting "From Stars to Galaxies: 
Building the Pieces to Build Up the Universe", eds. A. Vallenari, 
R. Tantalo, L. Portinari, and A. Moretti. ASP Conf. Ser (in press) 
(astro-ph/0702091 1 

Buat v.. et al., 2005, ApJ, 619, L51 

Caput! K.I., et al., 2006a, ApJ, 637, 727 

Caput! K.I., et al., 2006b, ApJ, 454, 143 

Caput! K.I., et al., 2007, ApJ, in press jastro-ph/0701283 I 

Calzett! D., Armus L., Bohlin R. C, et al., 2000, ApJ, 533, 682 

Chabrier G., 2003, ApJ, 586, L133 

Chapman, S. C, Blain, A. W., Smail, I., & Ivison. R. J. 2005, ApJ, 622, 772 

Chary R., Elbaz D., 2001, ApJ, 556, 562 (CEOl) 

Coleman, G. D., Wu, C.-C, & Weedman, D. W. 1980, ApJS, 43, 393 

Condon J. J., 1992, ARA&A, 30, 575 

Croton D. J., et al., 2006, MNRAS, 365, 1 1 

Daddi E., Cimatti A., Renzini A., et al., 2004a, ApJ, 600, L127 (D04a) 
Daddi E., Cimatti A.. Renzini A., et al., 2004b, ApJ, 617, 746 (D04b) 
Daddi E., et al., 2005a, ApJ, 626, 680 (D05a) 

Daddi E., Dickinson M., Chary R., et al, 2005b, ApJ, 631, L13 (D05b) 
Daddi E., Alexander D.M., Dickinson M., et al., 2007, ApJ in press 

I arXiv:0705.2832 I (Paper II) 
Dale D.A., Helou G., 2002, ApJ, 576, 159 (DH02) 
Dale D.A., et al., 2005, ApJ 633 857 
Dannerbauer H., et al., 2006, ApJ, 637, L5 

De Lucia G., Springel V., White S. D. M., Croton D., Kauffmann G., 2006, 
MNRAS, 366, 499 

Dickinson M., Papovich C, Ferguson H. C, Budavari T., 2003, ApJ, 587, 25 

Elbaz D., Cesarsky C.J., Chanial P, et al., 2002, A&A, 384, 848 

Elbaz D., Daddi E., Le Borgne D., et al., 2007, A&A in press 

I astro-ph/0703653 I 
Engelbracht C. W., et al., 2005, ApJ, 642, L127 
Finlator K., Dave R., Papovich C, Hernquist L., 2006, ApJ, 639, 672 



with similar slope and similarly small scatter to what 
observed, but with much lower normaUzation than the 
observed one. 

• We conclude by suggesting that current realizations 
of galaxy formation models could come to fairly good 
agreement with the observations available for z = 2 
galaxies if, by some means, the major periods of star 
formation activity could be delayed until later epochs 
than in current implementations. 
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